Reinforcement learning reward function in unmanned aerial vehicle control tasks

نویسندگان

چکیده

Abstract This paper presents a new reward function that can be used for deep reinforcement learning in unmanned aerial vehicle (UAV) control and navigation problems. The is based on the construction estimation of time simplified trajectories to target, which are third-order Bezier curves. applied unchanged solve problems both two-dimensional three-dimensional virtual environments. effectiveness was tested newly developed environment, namely, environment describing dynamics UAV flight, taking into account forces thrust, inertia, gravity, aerodynamic drag. In this formulation, three tasks were successfully solved: flight given point space, avoidance interception by another UAV, organization one another. most relevant modern algorithms, Soft actor-critic, Deep Deterministic Policy Gradient, Twin Delayed Gradient used. All algorithms performed well, indicating selected function.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning Unmanned Aerial Vehicle Control for Autonomous Target Following

While deep reinforcement learning (RL) methods have achieved unprecedented successes in a range of challenging problems, their applicability has been mainly limited to simulation or game domains due to the high sample complexity of the trial-and-error learning process. However, real-world robotic applications often need a data-efficient learning process with safety-critical constraints. In this...

متن کامل

Unmanned Aerial Vehicle Images

The main aim of this chapter is to give to the reader a complete overview about the general context in which the thesis is positioned. In a second part, the problems faced in the following chapters are introduced. Finally, we describe the proposed solutions and the thesis structure and organization. Chapter

متن کامل

designing unmanned aerial vehicle based on neuro-fuzzy systems

در این پایان نامه، کنترل نرو-فازی در پرنده هدایت پذیر از دور (پهپاد) استفاده شده است ابتدا در روش پیشنهادی اول، کنترل کننده نرو-فازی توسط مجموعه اطلاعات یک کنترل کننده pid به صورت off-line آموزش دیده است و در روش دوم یک کنترل کننده نرو-فازی on-line مبتنی بر شناسایی سیستم توسط شبکه عصبی rbf پیشنهاد شده است. سپس کاربرد این کنترل کننده در پهپاد بررسی شده است و مقایسه ای ما بین کنترل کننده های معمو...

Autonomous Landing Unmanned Aerial Vehicle

This thesis presents the system architecture for landing an Unmanned Aerial Vehicle (UAV) from a hovering position without the intervention of a human operator. Through the use of feedback information from a height sensor, the UAV is commanded to perform controlled descent with the desired landing parameters by implementation of the flight control laws. The plant model of the system was determi...

متن کامل

Fuzzy Adaptive Control of Unmanned Aerial Vehicle for Carrying Time-Varying Cargo on Predefined Path

At present, the use of unmanned aerial vehicles (UAVs) has been increased dramatically. The reasons for this development are cheapness, smallness, simplicity, and diversity of missions. The simplicity of guidance and control of multi-rotor drones is that they are equipped with an autopilot system. This system is responsible for flying control. UAVs do not have a high weight and often have three...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of physics

سال: 2022

ISSN: ['0022-3700', '1747-3721', '0368-3508', '1747-3713']

DOI: https://doi.org/10.1088/1742-6596/2308/1/012004